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ABSTRACT 

Objective: Low testosterone level may be a reversible 
risk factor for functional disability and deterioration in 
patients with chronic obstructive pulmonary disease 
(COPD). We sought to systematically assess the 
endogenous testosterone levels and effect of testosterone 
therapy on exercise capacity and health-related quality of 
life (HRQoL) outcomes in COPD patients, as well as to 
inform guidelines and practice. 
Design: Systematic review and meta-analysis. 
Data sources: We searched PubMed, Scopus, Cochrane 
Library, CINAHL, Health Source Nursing and PsychlNFO 
and the reference lists of retrieved articles published before 
May 201 2. 

Inclusion criteria: Observational studies on endogenous 
testosterone levels in people with chronic lung disease 
compared with controls, or randomised controlled trials 
(RCTs) on testosterone therapy for exercise capacity and/ 
or HRQoL outcomes in COPD patients were eligible. 
Data extraction and analysis: Data on the mean 
difference in endogenous total testosterone (TT) values, 
and the mean difference in exercise capacity and HRQoL 
values were extracted and pooled using random effects 
meta-analysis. 

Results: Nine observational studies in 2918 men with 
COPD reported consistently lower levels of TT compared 
with controls (weighted mean difference was -3.21 nmol/ 
L (95% CI -5.1 8 to -1 .23)). Six RCTs in 287 participants 
yielded five studies on peak muscle strength and peak 
cardiorespiratory fitness outcomes (peak oxygen uptake 
(V0 2 ) and workload) and three studies on HRQoL 
outcomes. Testosterone therapies significantly improved 
peak muscle strength (standardised mean difference 
(SMD) was 0.31 (95% CI 0.05 to 0.56)) and peak 
workload (SMD was 0.27 (95% CI 0.01 to 0.52)) 
compared with control conditions (all but one used 
placebo), but not peak V0 2 (SMD was 0.21 (95% CI 
-0.15 to 0.56)) or HRQoL (SMD was -0.03 (95% CI 
-0.32 to 0.25)). 

Conclusions: Men with COPD have clinically relevant 
lower than normal TT levels. Insufficient evidence from 
short-term studies in predominately male COPD patients 
suggests that testosterone therapy improves exercise 
capacity outcomes, namely peak muscle strength and 
peak workload. 



ARTICLE SUMMARY 



Article focus 

■ To systematically assess the mean endogenous 
testosterone level in people with chronic lung 
disease compared with controls from observa- 
tional studies. 

■ To systematically assess the effect of testoster- 
one therapy on exercise capacity and 
health-related quality of life outcomes in chronic 
obstructive pulmonary disease (COPD) patients 
from randomised controlled trials (RCTs). 

Key messages 

■ Men with COPD have clinically relevant lower 
than normal total testosterone levels compared 
with controls. 

■ Limited evidence from short-term RCT studies in 
predominately male COPD patients suggest that 
testosterone therapy improves exercise capacity 
outcomes, namely peak muscle strength and 
peak workload. 

Strengths and limitations of this study 

■ Key findings were based on a high-quality sys- 
tematic review and meta-analysis level of 
evidence. 

■ Since only a small number of studies conducted 
in specific populations were included, the find- 
ings of this review may not be relevant to other 
countries and key groups, which will require 
further research. 



INTRODUCTION 

Chronic obstructive pulmonary disease 
(COPD) is currently ranked the fifth leading 
cause of global disability (health loss). 1 
Health status or health-related quality of life 
(HRQoL) is a clinically important measure- 
ment of disability among patients with COPD 
for prognostic studies and trials. 2-4 Exercise 
capacity, one of the main determinants of 
HRQoL, is significandy impaired in COPD 
patients. 3 5 Dyspnoea and fatigue due to skel- 
etal muscle dysfunction, among other 
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physiological abnormalities, are cardinal symptoms that 
limit exercise capacity in COPD patients. 6 This is partly 
due to decreases in muscle strength and mass (often 
called 'cachexia'), since they are characteristic features 
of skeletal muscle dysfunction contributing to exercise 
intolerance and consequential deterioration in 
HRQoL. 5 6 Conversely, pulmonary rehabilitation (PR) 
including exercise (namely resistance training (RT)) 
leads to clinically relevant improvements in muscle 
strength and HRQoL, 7 8 indicating that skeletal muscle 
dysfunction should be a primary therapeutic target for 
intervention in patients with COPD. 

Since the testosterone level has been shown to be posi- 
tively associated with muscle strength and cardiorespira- 
tory fitness accounting for physical activity and muscle 
mass, 9 0 a low testosterone level may be an independent 
risk factor for functional disability and deterioration in 
COPD. For example, the levels of testosterone and other 
androgenic hormones were decreased in male and 
female COPD patients compared with controls in a few 
studies. 11-13 The potential mechanisms for this endo- 
crine dysfunction most likely involve hypoxaemia, hyper- 
capnia, systemic inflammation and the use of 
glucocorticoids. 14 

Thus, it is important to reliably establish whether the 
mean endogenous testosterone level is decreased in 
patients with COPD, because this condition is reversible 
with testosterone supplementation therapy. Indeed, a 
small but promising body of randomised controlled trial 
(RCT) evidence suggests that testosterone therapy 
improves exercise capacity and HRQoL without increas- 
ing serious adverse events. 15-17 While it is difficult to 
explain this apparent therapeutic benefit, increased 
cardiac output, 18 haemoglobin and haematocrit, 19 baror- 
eflex sensitivity 20 and exercise tolerance due to improve- 
ments in peak oxygen uptake (peak VO2) and muscle 
strength 20 are all plausible mechanisms. 

However, our initial analysis of the available published lit- 
erature indicates an absence of a systematic review of relevant 
studies on endogenous testosterone levels and testosterone 
therapy in patients with COPD. We therefore sought to sys- 
tematically review previous research to assess the mean 
endogenous testosterone level in people with chronic lung 
disease compared with controls, and the effects of testoster- 
one therapies on exercise capacity and HRQoL outcomes in 
COPD patients, to inform guidelines and practice. 

METHODS 
Search strategy 

We searched PubMed, Scopus, Cochrane Library, 
CINAHL, Health Source Nursing and the PsychlNFO 
electronic databases for articles published before May 
2012. Search syntaxes were developed in consultation 
with an experienced university research librarian taking 
into account a broad range of terms and phrases used in 
definitions of testosterone and COPD (full electronic 
search strategies for the PubMed, Scopus and Cochrane 



Library databases in online supplementary appendix 
pages 1 and 2). Reference lists of potentially eligible arti- 
cles were searched by hand to identify additional studies 
missed by our search strategy. 

Study selection 

One reviewer (EA) identified potentially relevant studies 
for inclusion by screening titles and/or abstracts of all 
citations identified with our database searches. A second 
screening was performed on the full text of these arti- 
cles. Observational studies in adult populations that 
reported endogenous testosterone levels in men and/or 
women (separately) with chronic lung disease (cases) 
compared with controls, or RCTs that reported the 
effects of testosterone treatment on exercise capacity 
and HRQoL outcomes in COPD patients were eligible. 
There were no language restrictions for articles. 

Data extraction 

Data extraction and quality assessment of included 
studies were performed and/or verified independentiy 
by three reviewers (EA, BC and SS). Discrepancies were 
resolved through discussion. Authors of relevant studies 
were contacted, where possible, for data that could not 
be extracted from the published articles. 

Quality assessment 

For methodology and quality assessment, quality check- 
lists were developed to identify potential sources of bias 
(tables in online supplementary appendix pages 3 and 
4) . Quality items for observational studies reviewed were 
(each worth 1 numerical point) as follows: (1) COPD or 
chronic lung disease was reported to have been clinically 
diagnosed or categorised according to the WHO 
International Statistical Classification of Diseases and 
Related Health Problems (ICD) system, (2) endogenous 
testosterone level was measured by radioimmunoassay or 
liquid chromatography-tandem mass spectrometry, 
(3) the study population was representative of the clin- 
ical setting or community (ie, demographic characteris- 
tics of cases and hospital controls were typical and 
community cases or controls were randomly selected) 
and (4) there was adequate adjustment or exclusion or 
matching for covariates known to be associated with 
COPD and hypogonadism in men (each worth 0.2 
numerical point) : (a) age, (b) socioeconomic or partner 
status, (c) central or general obesity, (d) smoking status, 
(e) alcohol intake, (f) physical activity, (g) depression 
or anxiety (or medications) , (h) metabolic syndrome or 
cardiovascular disease (or medications), (i) systemic 
inflammation (or glucocorticoids) and (j) sleep apnoea 
(or treatments). 

Quality items for RCT studies reviewed were (each 
worth 1.0 numerical point) as follows: (1) study eligibil- 
ity criteria were adequately described, (2) randomisation 
methodology was adequate (ie, evidence suggesting that 
the 'random' method was used to generate and imple- 
ment random allocation sequence), (3) allocation 
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concealment was adequate (ie, evidence to suggest that 
a robust method was used for concealing the sequence 
of treatment allocation (eg, independent IT or tele- 
phone service or sealed opaque envelopes only opened 
in front of the participant), (4) between-group prognos- 
tic indicators were balanced (ie, evidence showing that 
groups were similar at the outset for these prognostic 
indicators) , (5) care providers were blinded to treatment 
allocation, (6) between-group drop-out rates were 
balanced, (7) intention-to-treat analysis was included 
and (8) adverse events were reported. 

Our quality checklist scales were designed based on 
criteria for assessment of observational studies 21 and 
RCTs 21 22 and allowed summed scores to range from 0 
to 5 points and from 0 to 8 points, respectively, reflect- 
ing lowest to highest quality. Studies were considered 
'better quality' if they received a score of 3 or higher for 
observational studies and of 5 or higher for RCTs, since 
that meant that they had most of our quality items. 

Primary outcomes 

The primary outcomes were the mean difference in 
endogenous total testosterone (TT) values between the 
case and control groups for observational studies (the 
most frequently reported testosterone outcome in relevant 
studies) , and the mean difference in exercise capacity and 
HRQoL values after intervention (post-treatment) 
between the treatment and control groups for RCTs. 
Where necessary for observational studies, we estimated 
the mean and variance from the median, range and 
sample size using methods which have been shown to be 
reasonably robust in non-extreme circumstances. 23 Where 
necessary for RCTs, the post-treatment means were 
derived from the within-group changes and the control 
group SD carried forward from the baseline values. 24 
Standardised mean differences (SMDs) were calculated 
using Glass' Delta method. Exercise capacity outcomes 
included any assessment of cardiorespiratory fitness and 
peripheral skeletal muscle strength. Where multiple car- 
diorespiratory fitness outcomes were reported, we first 
chose peak VO2 measures and then prioritised peak work- 
load (power output) laboratory assessments of cardio- 
respiratory fitness over field tests. Where multiple muscle 
strength outcomes were reported, we prioritised peak iso- 
metric measures over peak dynamic measures and knee 
extension over other joint movements. HRQoL outcomes 
included any patient-reported assessment of health status 
or functional disability. Where multiple HRQoL out- 
comes/scales were reported, we first chose summed score 
scales, and then prioritised subscales that measure 
'fatigue' symptoms, and the most frequently reported 
HRQoL outcome in the other studies reviewed. 

Data synthesis 

Three reviewers (EA, BC and SS) independently collated 
and/or verified extracted data to present a descriptive 
synthesis of important study characteristics and a quanti- 
tative synthesis of effect estimates. 



Secondary outcomes 

The secondary outcomes were data about adverse events 
reported in RCTs for a descriptive synthesis. 

Statistical methods 

We pooled and weighted studies first using random 
effects meta-analysis models, and second using fixed 
effects models for verification. 25 Where necessary, we 
standardised laboratory values for endogenous TT levels 
between observational studies using the International 
System of Units (SI units), expressed in nanomoles per 
litre (nmol/L). These studies were then pooled to esti- 
mate the inverse variance weighted mean difference 
(WMD), including the DerSimonian and Laird 95% CI, 
between cases and controls. Where papers presented 
medians without means, we estimated the missing mean 
as being equal to the median for meta-analysis. 23 

In examining the effects of testosterone treatment on 
exercise capacity and HRQoL outcomes, the SMD from 
each RCT was pooled to produce an overall estimate of 
effect, and associated 95% CI, between the treatment 
and control groups. For each meta-analysis model, the 
degree of heterogeneity in SMDs was assessed by visual 
inspection, the I 2 statistic (moderate being <50% 26 ) and 
the % 2 test of goodness of fit. 27 Where evidence of het- 
erogeneity was observed, we checked data extracted 
from individual outlier studies, qualitatively investigated 
reasons for their different results and explored the 
effects of study exclusion in sensitivity analyses. 

We also used sensitivity analysis to investigate the 
robustness of the meta-analyses models. We variously 
excluded RCTs in men and women, placebo only 
(rather than placebo with exercise) controlled trials, 
longer duration trials (>12 weeks) and studies of lower 
quality (score <3 for observational studies; score <5 for 
RCTs). And we repeated the meta-analysis models using 
different cardiorespiratory fitness outcomes. Publication 
bias, which reflects the tendency for smaller studies to 
be published in the literature only when the findings are 
positive, was assessed visually using funnel plots. 28 All cal- 
culations were performed in Stata V.12 (StataCorp, 
College Station, Texas, USA) using the 'metan' and 
'metafunnel' commands. A two-tailed p value <0.05 was 
considered statistically significant throughout the 
analyses. 

RESULTS 

Figure 1 presents a flow chart summarising the identifi- 
cation of potentially relevant studies, as well as those 
included and excluded (see online supplementary 
appendix page 8). Our search strategy identified 906 
citations after the duplicates were removed. Of these, 
865 citations were excluded after the first screening of 
tides and/or abstracts for inclusion and exclusion cri- 
teria, leaving 41 citations for a second full text screen- 
ing. Hand searching the reference lists of these articles 
identified two additional potentially relevant citations. 
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906 Citations identified from literature search of electronic 
databases (22 duplicates removed) 



865 Citations excluded based on 

screening titles and/ or abstracts for 
inclusion and exclusion criteria 



41 Potentially relevant 
citations for inclusion 



2 Citations identified 
from reference lists 



28 Citations excluded 

14 Inadequate article, predictor or outcome 

9 No control group 

2 Not empirical study 

1 Male-female data not reported separately 

2 Redundant citation (duplicate publication) 



15 Citations included for review 

9 Observational studies on COPD -> endogenous testosterone level (N=2918) 
6 RCTs on testosterone treatment -> exercise capacity and/orHRQoL (N=287) 



Figure 1 Flow chart summarising identification of studies included for review. 



After further assessment of these 43 citations, 28 were 
excluded for reasons listed in figure 1, leaving 15 for 
final inclusion in the systematic review. Most studies were 
excluded for inadequate predictor or outcome variables, 
or for not having a control group (list of excluded cita- 
tions; see online supplementary appendix pages 5-7) . 

Descriptive data synthesis 

Table 1 presents the study characteristics of nine obser- 
vational studies included for review, which were pub- 
lished between 1981 and 2011. Studies were conducted 
in Scotland, 29 80 Sweden, 31 the USA, 32 Taiwan, 33 
Greece, 12 Turkey, 13 Norway 34 and Belgium. 35 The 
degree of severity of airflow limitation in COPD cases 
ranged from mild-to-very severe, assessed according to 
the Global Initiative for Chronic Obstructive Lung 
Disease criteria 3 *' in four studies, 12 13 33 35 and by various 

r 29 30 32 34 r n c • • 

spirometry criteria m four of the five remaining 

studies. Control participants were recruited from 

• ... "12 29 SO 32 33 35 , 

primary care settings in six studies. ' The 
sample sizes ranged from 16 to 213, resulting in a total 
of 2918 participants across studies. The mean age of the 
samples ranged from 50 to 71 years. All the observa- 
tional studies were conducted in men. The mean quality 



scores ranged from 2.2 to 4, and four studies received a 

r o u- u 12 13 32 34 

score of 5 or higher. 

Table 2 presents the study characteristics of six RCTs 
included for review, which were published between 2003 

37 

and 2011. Studies were conducted in the USA,' The 
Netherlands, 38 Brazil, 39 France, 16 Canada 40 and Norway. 41 
Major inclusion criteria were stable COPD or chronic 
respiratory failure in all studies, various spirometry criteria 
in all but one study, 16 low TT in only one study 37 and low 
body mass index in only two studies. 16 39 Major exclusion 
criteria were a range of chronic conditions in all studies, 
prostatic conditions in four studies 16 37 39 40 and elevated 
haemoglobin in one study. 37 The sample sizes ranged 
from 16 to 122, resulting in a total of 287 participants 
across studies. The mean age of the samples ranged from 
66 to 69 years. All but two studies 16 40 were conducted in 
men only. The baseline mean TT levels ranged from 9.6 to 
21.6nmol/L for men, and from 0.42 to 0.45 nmol/L for 
women as reported in one study. 16 The testosterone therap- 
ies used were oral testosterone undecanoate in one study, 16 
oral stanozolol after a baseline intramuscular injection of 
testosterone in another study 39 and intramuscular injec- 
tions (testosterone enanthate 37 41 and nandrolone decano- 
ate 38 ) in all the remaining studies. Four studies investigated 
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Table 1 Characteristics of observational studies reviewed 



- 



Participants 



Mean (SD) or median* 
(range) total 
testosterone (nmol/L) 



Study 

identification 



Cases, 
Country assessment 



Controls 



Sample 
size 



Sex 



Mean 

age 

(years) 



Cases 



Controls 



Covariate 
considerations 
(adjusted/excluded/ 
matched) 



Quality 
score 
(out of 5) 



Bratel et a/ 31 



Gow et aP 



Iqbal era/ 32 



Sweden 



Scotland 



USA 



Hsu et aP 



Kaparianos 
era/ 12 



Taiwan 



Greece 



Karadag era/ 1 



Turkey 



COPD with severe 
airway obstruction 
and daytime 
hypoxaemia, not 
reported 

COPD, spirometry 
FEVt <40% and 
FVC <65% 
predicted 
Chronic lung 
disease, 
predominantly 
spirometry FEV,/ 
FVC <80% 
predicted 



Chronic bronchitis 
and COPD, GOLD 
criteria stage 1-4 



COPD, GOLD 
criteria mean FEV! 
54%, mean FEV,/ 
FVC 59% 



COPD, GOLD 
criteria stage 2-3 



Age-matched 32 

'healthy' 

participants 



Inpatients ready for 26 
discharge 



Primary care clinic 85 
patients without 
history of chronic 
lung disease or 
corticosteroid 
treatments 



Outpatients with 213 
stable urolithiasis or 
prostatitis 



Outpatient smokers 125 



M 



69 



M 70 



M 62 



14.3 6.9 17.9 6.9 Age 



10.7 3.0- 11.0 1.8- Age, thyroid disease, oral 
19.5 21.9 corticosteroids 



11.1 9.8 14.0 8.6 



M 71 



M 61 



14.7 7.7 15.3 6.4 



11.2 4.4 18.4 4.5 



Age-matched 
participants 



125 



M 63 



13.2 5.5 16.6 5.5 



Age, ethnicity, BMI, 
physical activity, smoking 
status, caffeine and 
alcohol consumption, 
thyroid and rheumatologic 
conditions, medications 
including glucocorticoids, 
testosterone, and for 
osteoporosis 
Age, chronic diseases 
including treated benign 
prostate hyperplasia, 
other chronic lung 
disease, exacerbation 
Age, ethnicity, BMI, 
smoking, chronic 
diseases, endothelin-1 
pro-inflammatory allele, 
other chronic lung 
diseases, medications 
including p2-adrenergic 
agonists, corticosteroids, 
follicle stimulating 
hormone, erythrocyte 
sedimentation rate 
Age, sexual partner 
status, BMI, medications 



1.2 



2.9 



4.0 



2.9 



3.5 



3.0 



Continued 



Table 1 Continued 




Participants 



Mean (SD) or median* 
(range) total 
testosterone (nmol/L) 




Study 

identification 



Cases, 
Country assessment 



Controls 



Mean 
Sample age 
size Sex (years) 



Cases 



Controls 



Covariate 
considerations 
(adjusted/excluded/ 
matched) 



Quality 
score 
(out of 5) 




Semple et at 30 Scotland 



Svartberg ef a/ 34 Norway 




Van Vliet et aP 5 Belgium 



COPD, spirometry 
FEVt and FEVy 
FVC <70% 
Representative 
population with 
COPD, spirometry 
FEVt <50% 
predicted with 
FEVVFVC <70% 
predicted 
COPD, GOLD 
criteria stage 1 -4 



Age-matched 
inpatients 

Representative 
population with 
spirometry FEV^ 
>50% predicted 



Outpatients with 
normal lung 
function 



16 



2197 



M 50 



M 66 



13.1 4.4 20.3 5.4 



12.7 5.3 14.0 5.5 



99 



M 65 



9.0* 



6.8- 
12.9 



12.3* 



8.8- 
16.2 



that interfere with sex 
hormones, chronic 
diseases, treated 
urogenital disease, aged 
>75 years, regular 
systemic corticosteroids, 
oestradiol, tumour 
necrosis factor-a 
Age 



Age, waist circumference, 
smoking status 




Age, BMI, calculated low 
free testosterone, sex 
hormone binding globulin 



3.6 



2.9 



BMI, body mass index; BTS, British Thoracic Society; COPD, chronic obstructive pulmonary disease; FEV^ forced 
Chronic Obstructive Lung Disease; M, men; matched, considered if between-group difference in characteristic was 



expiratory volume in1 s; FVC, forced vital capacity; GOLD, Global Initiative for 
not likely statistically significant (p<0.05). 



Table 2 Characteristics of randomised controlled trials reviewed 



Population 




Sample 

Study identification Country size 



Major inclusion criteria 



Major exclusion criteria 



Baseline mean 
total 

testosterone 
(nmol/L) 



Sex (M/W) Mean age (years) Treated Controls 



Casaburi et aF USA 



47 



Creutzberg et aF The Netherlands 56 



Ferreira et aF Brazil 



3 ison era/ 16 



France 



17 



122 



Sharma etaF 



Canada 



16 



Svartberg et a/* 1 Norway 



29 



Stable COPD, spirometry FEVt 
<60% predicted and FEV^C 
<60%; total testosterone 
<13.9 nmol/L 

Stable COPD, ATS criteria, 
spirometry FEVi <70% predicted 
and increase in FEVi <10% 
postbronchodilation 
Ambulatory and stable COPD, 
spirometry maximal inspiratory 
pressure <60% predicted and 
BMI <20 kg/m 2 
Stable CRF, >1 8 years, Pa0 2 
<8 kPa, long-term oxygen 
therapy and/or home mechanical 
ventilation >3 months, BMI 
<21 kg/m 2 or fat-free mass index 
<25th centile 



Stable COPD, GOLD criteria 
stage 3-4, spirometry FEVt 
<50% predicted and FEV^FVC 
<0.7 



Stable COPD, moderate to 
severe, spirometry FEV-i <60% 
predicted 



CVD, low or high bodyweight, M 67 

prostatic indications, 

haemoglobin >16 g/dL, 

orthopaedic impairments 

Obesity, malignancies, CVD, M 66 

gastrointestinal inflammatory 

disorders, type 1 diabetes, 

oxygen dependency at rest 

CVD, prostatic indications M 69 



Pulmonary hypertension, M/W 66 

sleep apnoea, prostatic 

indications, neuromuscular 

diseases, cystic fibrosis, 

conditions compromising 

6-month survival, 

hormone-dependent cancer, 

women of childbearing age, 

elevated aminotransferase 

History of asthma, obesity, M/W 68 

malignancy, CVD, prostatic 

indications, renal, hepatic, 

gastrointestinal or endocrine 

disease, recent surgery 

<2 months 

Asthma, malignancies, CVD, M 66 
hepatic or endocrine disease 



(a) (a) 10.5; 
10.5; (b)9.6 

(b) 14.1 



13.4 



14.4 



M 12.7; 
W0.45 



21.6 



14.6 



17.2 



M 13.6; 
W0.42 



MNR WNR 



20.5 



Continued 



o 

T3 
<D 

> 

O 
O 
CD 
(A 
(A 



Table 2 Continued 



Treatments 



Control conditions 



Trial 

duration 

(weeks) 



Outcomes (assessments, units) 



Quality 
score (out 
of 8) 



(a) Testosterone enanthate, 100 mg/ 
week IM; (b) Testosterone enanthate, 
100 mg/week IM with RT 
Nandrolone decanoate, 50 mg/ 
2 weeks IM 

Testosterone, 250 mg IM at baseline 
and oral stanozolol, 12 mg/day with 
PR 

Oral testosterone undecanoate, M 
80 mg/W 40 mg twice daily with PR 

Nandrolone decanoate, M 50 mg/W 
25 mg/2 weeks IM with PR 



Testosterone enanthate, 250 mg/ 
4 weeks IM 



(a) Placebo; (b) Placebo with RT 10 



Placebo 



Placebo with PR 9-27 weeks 27 



Home education on 13 
self-management of 
COPD-related stress and anxiety 
Placebo with PR 16 



Placebo 



26 



Muscle strength (leg press, 1RM kg); cardiorespiratory fitness 
(bicycle, peak V0 2 L/min and peak workload Watts) 

Muscle strength (knee extension, peak isometric force 
Newtons); cardiorespiratory fitness (bicycle, peak workload 
Watts and peak V0 2 mL/min); HRQoL (SGRQ total score) 
Cardiorespiratory fitness (bicycle, peak V0 2 % predicted; 
6MWT, distance metres) 

Muscle strength (knee extension, peak isometric force 
Newtons); cardiorespiratory fitness (6MWT, distance metres; 
bicycle, peak workload Watts); HRQoL (CRQ total score) 
Muscle strength (knee extension, peak isometric force units 
NR); cardiorespiratory fitness (bicycle, peak V0 2 % predicted 
and peak workload Watts; 6MWT, distance metres); HRQoL 
(CRQ fatigue subscore) 

Cardiorespiratory fitness (6MWT, distance metres) 



5.0 



1RM, one repetition maximum; 6MWT, 6 min walking test; ATS, American Thoracic Society; BMI, body mass index; COPD, chronic obstructive pulmonary disease; CRF, chronic respiratory 
failure; CRQ, chronic respiratory questionnaire; CVD, cardiovascular diseases; FEVi, forced expiratory volume in 1 s; FVC, forced vital capacity; GOLD, Global Initiative for Chronic Obstructive 
Lung Disease; HRQoL, health-related quality of life; IM, intramuscular injection; M, men; NR, not reported; PR, pulmonary rehabilitation; RT, resistance training; V0 2 , volume of oxygen uptake; 
W, women. 
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Study 

Identification 



Bratel et al, 2000 
Gow et al, 1987 ' 
Iqbal et al, 1999 
Hsuet al, 2006 

Kaparianos et al, 2011 — 
Karadag et al, 2007 

Sempleet al, 1981 

Svartberg et al, 2007 
Van Vliet et al, 2005 
Overall (l-squared = 81.9%, p = 0.000) 



i — 

-15 



<t> 



~i r - 

-10 -5 
COPD unfavourable 



WMD (95% CI) 



% 
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Figure 2 Weighted mean difference in the endogenous total testosterone level between the case and control groups for 
observational studies. 



the combined effects of testosterone therapy with RT or 
PR. 16 39 40 All but one study 16 used placebo control condi- 
tions. Trial durations ranged from 8 to 27 weeks. Primary 
outcomes were peak muscle strength in five studies (from 
four citations 16 38 40 ), peak V0 2 in five studies (from 
four citations 37-40 ), peak workload in five studies 
(from four citations 16 38 *") , 6 min walking test (6MWT) 
in four studies 16 39 ^ 1 and HRQoL in three studies. 16 38 40 
The mean quality scores ranged from 4.5 to 6, and all but 
one study 40 received a score of 5 or higher. 

Quantitative data synthesis 

Effect of COPD exposure on an endogenous TT level 

Figure 2 presents WMD on an endogenous TT level 
between the case and control groups for observational 
studies (see online supplementary appendix page 9). 
Men with COPD had significantiy lower levels of TT com- 
pared with controls (pooled WMD was -3.21 nmol/L 



(—5.18 to —1.23)). There was a high degree of heterogen- 
eity between studies (I 2 =81.9%, p<0.001) that was mostiy 
a result of variation in the degree of deference rather 
than an unfavourable direction towards the null. The sen- 
sitivity analyses presented in table 3 show that the pooled 
WMD was substantially changed after exclusion of lower 
quality studies (increased to -3.68 (—7.00 to —0.36)) and 
one large sample size study 34 (increased to -3.56 (—5.63 
to —1.49)). Finally, for the one study 33 which provided 
unadjusted mean differences and mean differences 
adjusted for age, waist circumference and smoking status, 
a model using unadjusted rather than adjusted values 
decreased the pooled WMD to -2.95 (-4.63 to -1.27). 
In addition, a funnel plot was produced which showed 
only slight evidence of publication bias, since WMD in 
TT was small (-0.60 33 and -1.10 nmol/L 34 ) for two of 
the largest studies (figure 3; see online supplementary 
appendix page 10) . 



Table 3 Sensitivity analysis of observational studies on COPD exposure -»• total testosterone outcome meta-analysis 






N 

studies 


N 

sample 


Total testosterone 
WMD 


(95% CI) 




p Value for 
heterogeneity 


Fixed effects model 


9 


2918 


-3.00 


(-3.75 to 


-2.26) 


<0.001 


Exclusion of five lower quality studies 


4 


2532 


-3.68 


(-7.00 to 


-0.36) 


<0.001 


(score <3) 














Model using unadjusted rather than 


9 


2918 


-2.95 


(-4.63 to 


-1 .27) 


<0.001 


adjusted values in one study 














Exclusion of a large sample size study 


8 


721 


-3.56 


(-5.63 to 


-1.49) 


<0.001 


COPD, chronic obstructive pulmonary disease; N, number; WMD, weighted mean difference. 
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Funnel plot with pseudo 95% confidence limits 
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Figure 3 Funnel plot assessing symmetry of the weighted 
mean difference in the total testosterone level between the 
case and control groups for observational studies. 

Effect of testosterone therapy on exercise capacity and 
HRQoL outcomes 

Figure 4 presents SMD in peak muscle strength out- 
comes after testosterone therapy between the treatment 
and control groups for RCTs (see online supplementary 
appendix page 11). Testosterone therapies significandy 
improved standardised peak muscle strength outcomes 
compared with control conditions (pooled SMD was 
0.31 (0.05 to 0.56)), and there was little evidence of stat- 
istical heterogeneity between studies (I 2 =0%, p=0.839). 
The sensitivity analyses presented in table 4 shows that 
the pooled SMD was similar after exclusion of one lower 
quality study 40 (0.31 (0.04 to 0.57)), but was substantially 



changed after exclusion of two placebo only controlled 
studies (no longer statistically significant 0.30 (—0.01 to 
0.62) ) , and the two studies in men and women that were 
also the two longer duration studies 16 40 (decreased to 
0.21 (—0.18 to 0.60)). In addition, a funnel plot was pro- 
duced which showed only slight evidence of publication 
bias, since SMD in peak muscle strength outcomes was 
consistent in all but one treatment arm in one study 
(figure 5; see online supplementary appendix page 12). 

Figure 6 presents SMD in peak VO2 outcomes after 
testosterone therapy between the treatment and control 
groups for RCTs (see online supplementary appendix 
page 13). Testosterone therapies consistently failed to 
show significant improvements in standardised peak 
VO2 outcomes compared with control conditions 
(pooled SMD was 0.21 (-0.15 to 0.56); I 2 =4.8%, 
p=0.379) . The sensitivity analyses presented in table 5 
shows that this null effect was similar after exclusion of 
one lower quality study 40 (0.13 (-0.27 to 0.54)), two 
placebo only controlled studies (0.03 (—0.60 to 0.66)), 
one study in men and women 40 (0.13 (—0.27 to 0.54)), 
and two longer duration studies 39 40 (0.27 (—0.12 to 
0.67)), and in the model using 6MWT outcomes (0.10 
(—0.34 to 0.53)). Conversely, testosterone therapies sig- 
nificantly improved cardiorespiratory fitness in the 
model using peak workload rather than peak VO2 out- 
comes (pooled SMD was 0.27 (0.01 to 0.52)), and there 
was little evidence of statistical heterogeneity between 
studies (I 2 =0%, p=0. 741). 

Figure 7 presents SMD in peak HRQoL outcomes 
after testosterone therapy between the treatment and 
control groups for RCTs (see online supplementary 



Study 

identification 



Casaburi et al. 2004a 

Casaburi et al. 2004b 

Creutzberg et al. 2003 

Pisonet ad, 201 1 

Sharma et al, 2008 

Overall (l-squared = 0.0%, p = 0.839) 



4> 



-2 -1 
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SMD (95% Q) Weight 



0.33 (-0.48, 1.15) 9.96 
-0.15 (-0.97, 0.67) 9.76 
0.31 (-022, 0.84) 23.36 
0.39(0.03,0.75) 50.23 
0.29 (-0.70, 128) 6.69 
0.31 (0.05,0.56) 100.00 
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Figure 4 Standardised mean difference in peak muscle strength outcomes after testosterone therapy between the treatment 
and control groups for randomised controlled trials. 
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Table 4 Sensitivity analysis of randomised controlled trials on testosterone treatment -> muscle strength outcomes 
meta-analysis 





N studies 


N sample 


SMD 


(95% CI) 


p Value for 
heterogeneity 


Fixed effects model 


5 


241 


0.31 


(0.05 to 0.56) 


0.839 


Exclusion of one lower quality study (score <5.0) 


4 


225 


0.31 


(0.04 to 0.57) 


0.699 


Exclusion of two placebo only control studies 


3 


161 


0.30 


(-0.01 to 0.62) 


0.491 


Exclusion of two studies in men and women 


3 


103 


0.21 


(-0.18 to 0.60) 


0.611 


Exclusion of two longer duration studies (>1 2 weeks) 


3 


103 


0.21 


(-0.18 to 0.60) 


0.611 



N, number; SMD, standardised mean difference. 



appendix page 14). Testosterone therapies consistently 
failed to show better standardised HRQpL outcomes 
compared with control conditions (pooled SMD was - 
0.03 (-0.32 to 0.25); I 2 =0%, p=0.934). The sensitivity 
analyses showed that this null effect was comparable in 
the fixed effects model (-0.03 (-0.32 to 0.25)) and 



after exclusion 
(-0.34 to 0.25)) 



of one lower quality study (—0.04 



Adverse events 

Two RCTs showed that testosterone therapy was asso- 
ciated with more serious adverse events compared with 
the control group. One study reported an increased 
number of exacerbations during short-term, but not long- 
term, follow-up, 16 and another study reported that two of 
three COPD patients with respiratory failure in the treat- 
ment group had died. 38 Conversely, one study reported 
that more patients died of respiratory failure in the 
control group. 39 Four studies showed that testosterone 
therapies decreased gonadotrophin levels compared with 
controls, as can be expected. 16 3 39 41 Compared with 
controls, testosterone therapy was associated with a 
decrease in the sex hormone-binding globulin level in 
two studies, 16 41 and a decrease in the oestradiol level in 



Funnel plot with pseudo 95% confidence limits 




SMD 



Figure 5 Funnel plot assessing symmetry of standardised 
mean difference in peak muscle strength outcomes after 
testosterone treatment between the treatment and control 
groups for randomised controlled trials. 



412 



men in another study. Finally, few studies showed that 
testosterone therapy was associated with relative increases 

16 3*7 38 

in haemoglobin or haematocrit; ' creatinine, aspar- 
tate aminotransferase and lactate dehydrogenase 
values. 38 



DISCUSSION 
Summary of evidence 

We have established that men with COPD have signifi- 
candy lower levels of endogenous TT compared with 
controls (WMD was -3.21 nmol/L (-5.18 to -1.23)). 
The size of the mean difference in TT level, which ranks 
men with COPD in the second quartile (below average) 
compared with age-matched population norms, 9 is likely 
to be clinically relevant. For instance, comparable or 
greater differences in TT levels between cases and con- 
trols have been reported in studies on risk of type 2 dia- 
betes (WMD was -2.66 nmol/L (-3.45 to -1.86)), " 
metabolic syndrome (WMD was —2.64 nmol/L (—2.95 to 
—2.32)) 43 and clinically significant depression (median 
difference was -1.21 nmol/L, p<0.001 for Mann-Whitney 
test). 44 These comorbidities have been shown to 
adversely affect COPD prognosis, 45-47 and would further 
complicate COPD management. As the effect of COPD 
exposure on TT level increased in size after exclusion of 
lower quality studies and one large sample size study, 
future higher quality studies will most likely strengthen 
rather than weaken this evidence base. Collectively, our 
results and the existing literature indicate that testoster- 
one deficiency should be considered in men with 
COPD. 

On the basis of the limited short-term RCT evidence in 
predominately male COPD patients, our results suggest 
that testosterone therapy significantly improves several 
exercise capacity outcomes. The size of the effect of testos- 
terone therapy that can be expected in practice is small to 
moderate, but comparable to exercise or PR therapies 
alone. 7 8 The effect of testosterone therapy on standar- 
dised muscle strength outcomes remained robust after 
exclusion of one lower quality study, but weakened after 
exclusion of two placebo only studies. This supports the 
hypothesis that testosterone therapy with exercise is more 
effective than testosterone therapy alone for functional 
improvements. 48 In addition, our results suggest that the 
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Study 

Identification 

Casaburi et al. 2004a 
Casaburi et al. 2004b 
Creutzberg et al. 2003 

Ferreira et al. 1998 

Sharma et al. 2008 

Overall (l-squared = 4.8%. p = 0.379) 



SMD (95% CI) Weight 

0.03 (-0.77, 0.83) 18.94 

0.06 (-0.76, 0.87) 18.14 

0.48 (-0.06, 1.02) 39.43 
-0.58 (-1.61, 0.44) 11.86 

0.61 (-0.42, 1.65) 11.62 

0.21 (-0.15, 0.56) 100.00 



Favours control 

■4 



Favours treatment 
► 



Figure 6 Standardised mean difference in peak V0 2 outcomes after testosterone therapy between the treatment and control 
groups for randomised controlled trials. 



mechanism for improvement in cardiorespiratory fitness 
assessed by peak workload is most likely explained by 
better exercise tolerance due to testosterone-induced 
increases in muscle strength rather than changes in VO2. 

Limitations 

Several limitations require careful consideration. Since 
only a small number of studies conducted in specific 
populations were included, the findings of this review 
may not be relevant to other countries and key groups, 
which will require further research. In particular, most 
of the RCTs were conducted in COPD patients without 
cardiovascular disease and/or diabetes or endocrine 
disease, which are highly prevalent in this population 
group. 45 Second, we replaced missing data points with 
estimates in some instances, which introduced further 



uncertainty. This includes estimating the mean from the 
median and range and carrying forward the preinterven- 
tion SD of control groups where the postintervention 
statistic was not available. Third, because only a few 
RTCs targeted COPD patients who would have theoretic- 
ally benefited most from testosterone therapy such as 
those with low testosterone or body weight, ' our 
estimated effect size for improvement in standardised 
exercise capacity may have been underestimated. Finally, 
the reviewer-level limitations include incomplete 
retrieval of information for several of the 28 citations 
excluded, as well as the existence of other relevant 
studies not identified with our search strategy, resulting 
in selection bias. However, the results and conclusions 
reported in most of the excluded studies were in line 
with those reported here, and selection bias was unlikely. 



Table 5 Sensitivity analysis of randomised controlled trials on testosterone treatment 
meta-analysis 



cardiorespiratory fitness outcomes 





N 

studies 


N 

sample 


SMD (95% CI) 


p Value for 
heterogeneity 




Fixed effects model 


5 


136 


0.21 


(-0.13 to 0.56) 


0.379 




Exclusion of one lower quality study (score <5) 


4 


120 


0.13 


(-0.27 to 0.54) 


0.315 




Exclusion of two placebo only control studies 


3 


56 


0.03 


(-0.60 to 0.66) 


0.269 




Exclusion of one study in men and women 


4 


120 


0.13 


(-0.27 to 0.54) 


0.315 




Exclusion of two longer duration study 


3 


103 


0.27 


(-0.12 to 0.67) 


0.553 




(>1 2 weeks) 














Model using peak workload rather than peak V0 2 


5 


241 


0.27 


(0.01 to 0.52) 


0.741 




outcomes 














Model using 6MWT rather than peak V0 2 


4 


184 


0.10 


(-0.34 to 0.53) 


0.210 





outcomes 



6MWT, 6 min walking test; N, number; SMD, standardised mean difference. 
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Study 

identification 



Creutzberg et al. 2003 
Pi son etal. 2011 
Sharma et al, 2008 



Overall (l-squared =0.0%. p =0.934) 



% 



SMD(95%Q) Weight 



-0.11 (-0.64,0.41) 28.81 
-0.01 (-0.37, 0.34) 62.94 
0.06 (-0.92, 1.05) 825 
-0.03 (-0.32, 0.25) 100.00 



-2-1012 
Favours treatment Favours control 
■4 ► 

Figure 7 Standardised mean difference in peak health-related quality of life outcomes after testosterone therapy between the 
treatment and control groups for randomised controlled trials. 



Nevertheless, our systematic analysis of the existing lit- 
erature revealed that there is an absence of sufficient 
RCT evidence to draw firm conclusions about the long- 
term benefits and risks of testosterone therapies for 
exercise capacity and HRQoL outcomes in male or 
female COPD patients, or about the pharmacological 
dosing for specific testosterone therapies needed for 
effectiveness. Reliable information on the efficacy and 
safety, as well as cost-effectiveness, of specific testosterone 
therapies is required to inform clinical practice guide- 
lines for COPD. In addition, future high-quality epi- 
demiological research is needed to determine which 
subgroups of COPD patients are most vulnerable to tes- 
tosterone deficiency, and to reliably establish whether 
women with COPD likewise present with significandy 
lower levels of TT than controls. 



CONCLUSIONS 

Men with COPD have clinically relevant lower than 
normal endogenous TT levels, and we believe that our 
meta-analytic results are sufficientiy reliable to recom- 
mend that clinicians should consider testosterone defi- 
ciency in these patients. Although our results also 
suggest that testosterone therapy improves several exer- 
cise capacity outcomes, there is an absence of sufficient 
RCT evidence to draw firm conclusions about the long- 
term benefits and risks of testosterone therapy for exer- 
cise capacity and HRQoL outcomes in male or female 
COPD patients. 
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